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TRANSFORM SPARSE ARRAY MATRIX CODE TO PERFORM 
RUN TIME DEPENDENCY CHECK 

I \20 



SOFTWARE-PIPELINE THE TRANSFORMED SPARSE ARRAY MATRIX CODE 



2Y0 



FORM A PREDETERMINED NUMBER OF VARIABLES BASED ON A 
VIRTUAL UNROLLING FACTOR M 



INITIALIZE THE FORMED PREDETERMINED NUMBER OF VARIABLES 
BY CREATING VARIABLES bO, bl, ... bM-1 AND aO, al, ... a M-1 
AND INITIALIZING bO, bl, ... bM-1 TO AN ILLEGAL VALUE FOR 
THE b ARRAY, SAY TO -1 



LOAD PRIOR COMPUTED VALUES SUCH THAT b[i] IS LOADED INTO 
THE VARIABLE bM AND THE VALUE OF c[i] IS LOADED INTO THE 

VARIABLE cM 



I 



ASSIGN THE PRIOR COMPUTED VALUES, INSIDE THE LOOP BODY, 
TO A PREDETERMINED NUMBER OF REGISTERS 
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m 



UNPIPELINED (17 CYCLES PER ITERATION) 

TIME - > 0123456789012345678901234567890123456788901234567890123456789 

ITER01: L A S 

ITER02: L A S 

ITER03: L A S 

ITER04: L— - 



UNROLLED (9 CYLES PER ITERATION) 

TIME > 0123456789012345678901234567890123456788901234567890123456789 

ITER01: L A S 

ITER02: L A S 

ITER03: L A S 

ITER04: L A S 

ITER05: L A S 

ITER06: L A S 



SOFTWARE PIPELINED (5 CYLES PER ITERATION) 
TIME-> 0123456789012345678901234567890123456788901234567890123456789 



ITER01 
ITER02 
ITER03 
ITER04 
ITER05 
ITER06 



L A S 

L A S 

L A S 

L-- A S 

L A S 

L A S 
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ITERATIONS 



STAGE 1 




CYCLE 0: b3 = LOAD b[i] 

.7 _ 1 A»n „r:1 

CJ = LUAU qij 

CYCLE 1: 

CYCLE 2: COMPUTE &(a[b(i)]) 




cmp.ne pl,p4 = b3, b2 
CYCLE 3: a3old = a[b(i)J 
CYCIF 4* cmnneuncDl = 


b3,b1 


CTA^r 0 
blAGL I 




uiLt d: (pi J cmp.ne.unc pi ,pz - 


Dj.DU 


CYCLE 6: 

rvf*i c "7. 

CYCLL /: 




CYCLE 8: 
CYCLE 9: 




SIAGL j 




CYCLE 10: (p2J ao = aO + cj 




CYCLE 11: (p3) a3 = al + c3 
CYCLL 12: (pi) a J = a3o!d t cj 




CYCLE 13: (p4) a3 = o2 + c3 
CYCLE 14: 




STAGE 4 




CYCLE 15: 




CYCLE 16: 
CYCLE 17: 




CYCLE 18: 
CYCLE 19: 




STAGE 5 




CYCLE 20: 




CYCLE 21: store a[b(i)] = a3 - 
CYCLE 22: ( 

CYCLE 23: 
CYCLE 24: 
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STAGE 1 




CYCLE 0: b3 = LOAD b[i] 

c3 = LOAD c[i] 
CYCLE 1: 

CYCLE 2: COMPUTE &(a[b(i)]) 




. cmp.ne pl,p4 = b3, b2 
CYCLE 3: a3old = a[b(i)] 
CYCLE 4: (pi) cmn.ne.unc.p1,p3 = 


b3,b1 


STAGE 2 




CYCLE 5: (pi) cmp.ne.unc p1,p2 = 


b3,b0 


CYCLE 6: 
CYCLE 7: 




CYCLE 8: 
CYCLE 9: 




STAGE 3 




CYCLE 10: (p2) a3 = aO + c3 




CYCLE 11: (p3) a3 = al + c3 
CYCLE 12: (pi) a3 = a3old + c3 
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PROCESSING 
UNIT 



NETWORK 
INTERFACE 



BUS 



VOLATILE 
MEMORY 



NON-VOLATILE 
MEMORY 



PROGRAM 



REMOVABLE 
STORAGE 



COMMUNICATION 
CONNECTION 



5U 



NON-REMOVABLE 
STORAGE 



INPUT 



OUTPUT 



